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Abstract — In this paper, we investigate tlie delay-throughput 
trade-offs in mobile ad-hoc networks under two-dimensional i.i.d. 
mobility models. We consider two mobility time-scales: (i) Fast 
mobility where node mobility is at the same time-scale as data 
transmissions; (ii) Slow mobility where node mobility is assumed 
to occur at a much slower time-scale than data transmissions. 
Given a delay constraint D, the main results are as follows: (1) 
For the two-dimensional i.i.d. mobility model with fast mobiles, 
the maximum throughput per source-destination (S-D) pair is 
shown to be O ^ \fDjn^ , where n is the number of mobiles. (2) For 
the two-dimensional i.i.d. mobility model with slow mobiles, the 
maximum throughput per S-D pair is shown to be O (^-^D/nJ . 
(3) For each case, we propose a joint coding-scheduling algorithm 
to achieve the optimal delay-throughput trade-offs. 



I. Notations 

The following notations are used throughout this paper, 
given non-negative functions /(«) and g{n): 

(1) /(«) — 0{g{n)) means there exist positive constants c 
and m such that /(n) < cg{n) for all n > m. 

(2) /(«) ^Q.{g{n)) means there exist positive constants c 
and m such that /(«) > cg{n) for all n > m. Namely, 
g{n) = 0{fin)). 

(3) f[n) = &{g{n)) means that both f{n) = and 
fin) =Oig{n)) hold. 

(4) f(n) = o{g{n)) means that lim„^oo/(«)/g(«) = 0. 

(5) /(«) = (o{g{n)) means that lim„^oo §(«)//(«) = 0. 
Namely, g{n) =o{f{n)). 

II. Introduction 

The throughput of a random wireless network with n static 
nodes and « random S-D pairs was studied by Gupta and 
Kumar [10]. They showed that the maximum throughput per 
S-D pair is (9(l/^/n), and proposed a scheduling scheme 
achieving a throughput of 0(l/^/«log«) per S-D pair. The 
throughput decreases with n because each successful trans- 
mission from source to destination needs to take ^Jn/ log n 

An earlier version of this paper appeared in the Proc. of WiOpt. 2007. 
Reseai'ch presented here was supported in part by a Vodafone Fellowsliip 
and NSF grant CNS 05-19691. 



hops. Later Grossglauser and Tse [9] considered mobile ad- 
hoc networks, and showed that 0(1) throughput per S-D pair 
is achievable. The idea is to deliver a packet to its destination 
only when it is within distance @{\/y/n) from the destination. 
However, packets have to tolerate large delays to achieve this 
throughput. 

We first review the results for i.i.d. mobility models. Neely 
and Modiano [15] studied the i.i.d. mobility model where the 
positions of nodes are totally reshuffled from one time slot to 
another, and showed that the mean delay of Grossglauser and 
Tse's algorithm is 0(«). In the same paper, they also proposed 
an algorithm which generates multiple copies of each data 
packet to reduce the mean delay. Since more transmissions 
are required when we generate multiple copies, the throughput 
per S-D decreases with the number of copies per data packet. 
The delay-throughput trade-off is shown to be A = Q.{D/n) in 
[15], where A is the throughput per S-D pair, and D is the 
number of time slots taken to deliver packets from source to 
destination. 

In [15], fast mobility is assumed. A different time-scale 
of mobility, slow mobility, was considered by Toumpis and 
Goldsmith in [20], and Lin and Shroff in [11]. For slow 
mobiles, node mobility is assumed to be much slower than 
data transmissions. So the packet size can be scaled down 
as n increases, and multi-hop transmissions are feasible in 
single time slot. The delay-throughput trade-off was shown 
to be A = D. (^^/ojrilogtij in [20]. A better trade-off was 
obtained in [11], where the maximum throughput per S-D pair 
for mean delay D was shown to be A = O ^\/D/«log«^ 
and a scheme was p roposed to achieve a trade-off of A 

0^yD/f„iog^/2„y 



Besides the i.i.d. mobility model, other mobility models 
have also been studied in the literature. The random walk 
model was introduced by El Gamal et al in [5], and later stud- 
ied in [6], [7] and [18]. In [6] and [7], the throu ghput pe r S-D 
pair is shown to be 0(1 / ^/nlog«) for D = 0(\/n/\ogn), and 
0(D/«)forD: 



: n(-\/«/log«), where [6] focused on the slow 
mobility and [7] focused on the fast mobility. Other mobility 
models, like Brownian motion, one dimensional mobility, and 
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hybrid random walk models have been studied in [12], [3], [8] 
and [18]. 

Although the delay-throughput trade-off has been widely 
studied for various mobility models, the optimal delay- 
throughput trade-off has not yet been established except for 
two cases of mobility models [6], [7], [12]. In this paper, 
we investigate ad-hoc networks with the two-dimensional i.i.d. 
mobility. Our main results are as follows: 

(1) For the two-dimensional i.i.d. mobility model with fast 
mobiles, we show that the maximum throughput per S-D 
pair is O (^\/D /nj under a delay constraint D. A joint 
coding-scheduling algorithm is presented to achieve the 
maximum throughput for D is both (o{y^) and o(n). 

(2) For the two-dimensional i.i.d. mobility model with slow 
mobiles, we first prove that the maximum throughput per 
S-D pair is O (^-s/D/tiJ given a delay constraint D. Then 
we propose another joint coding-scheduling algorithm to 
achieve the maximum throughput for D is both ©(1) and 
o(n). In both case (1) and (2), we need a lower bound 
on delay to ensure decodability of packets with high 
probability for large n. 

The above results can be extended to other mobility models 
as shown in a companion paper [21]. 

We also would like to mention that there is a very recent 
result by Ozgur, Leveque, and Tse [16] where they showed 
a throughput of 0(1) per S-D pair is achievable using node 
cooperation and MIMO communication; see also the earlier 
paper by Aeron and Saligrama in [1]. These schemes require 
sophisticated signal processing techniques, not considered in 
this paper 

The remainder of the paper is organized as follows: In 
Section |III1 we introduce the communication and mobility 
model. Main results along with some intuition into them are 
presented in Section |IV] Then we analyze the two-dimensional 
i.i.d. mobility models with fast mobiles in Section [V] and slow 
mobiles in Section [Vl] Finally, the conclusions is given in 
Section IVIII In the appendix, we collect some results that are 
frequently used in the paper. 

III. Model 

In this section, we first present the mobility and wireless 
interference models used in this paper. Then the definitions of 
delay and throughput are provided. 

Mobile Ad-Hoc Network Model: Consider an ad-hoc net- 
work where wireless mobile nodes are positioned in a unit 
square. Assuming the time is slotted, we study the two- 
dimensional i.i.d. mobility model in this paper, which was 
introduced in [15] and defined as follows: 

(i) There are n wireless mobile nodes positioned on a unit 
square. At each time slot, the nodes are uniformly, 
randomly positioned in the unit square. 

(ii) The node positions are independent of each other, and 
independent from time slot to time slot. So the nodes 
are totally reshuffled at each time slot. 

(iii) There are « S-D pairs in the network. Each node is both 
a source and a destination. Without loss of generality. 



we assume that the destination of node / is node / + 1 , 

and the destination of node n is node 1. 
Communication Model: We assume the protocol model intro- 
duced in [10] in this paper Let dist(/,j) denote the Euclidean 
distance between node / and node j, and r, to denote the 
transmission radius of node /. A transmission from node / can 
be successfully received at node j if and only if following two 
conditions hold: 

(i) dist(/,;) < n; 

(ii) dist{kj) > (1 + A)dist(i,y) for each node i which 
transmits at the same time, where A is a protocol- 
specified guard-zone to prevent interference. 

We further assume that at each time slot, at most W bits can 
be transmitted in a successful transmission. 
Time-Scale of Mobility: Two time-scales of mobility are 
considered in this paper 

(1) Fast mobility: The mobility of nodes is at the same 
time-scale as the data transmission, so W is a constant 
independent of « and only one-hop transmissions are 
feasible in single time slot. 

(2) Slow mobility: The mobility of nodes is much slower 
than the wireless transmission, so W ^ n. Under this 
assumption, the packet size can be scaled as W/H{n) 
for H{n) = 0{n) to guarantee //(«)-hop transmissions 
are feasible in single time slot. 

Delay and Throughput: We consider hard delay constraints 
in this paper Given a delay constraint D, a packet is said to 
be successfully delivered if the destination obtains the packet 
within D time slots after it is sent out from the source. 

Let A, [r] denote the number of bits successfully delivered to 
the destination of node / in time interval [0, T] . A throughput of 
X per S-D pair is said to be feasible under the delay constraint 
D and loss probability constraint e > if there exists «o such 
that for any n> hq, there exists a coding/routing/scheduling 
algorithm with the property that each bit transmitted by a 
source is received at its destination with probability at least 
1 — e, and 

HmPrj^^^ >A, V =1. (1) 

IV. Main Results and Some Intuition 

Recall that our objective is to maximize throughput in a 
wireless network subject to a delay constraint and a wireless 
interference constraint. More precisely, the constraints can be 
viewed as follows: 

(1) Wireless interference: Throughput is limited due to the 
fact that transmissions interfere with each other 

(2) Mobility: A packet may not be delivered to its destina- 
tion before the delay deadline since neither the packet's 
source nor any relay node may get close enough to the 
destination. 

In this section, we present some heuristic arguments to obtain 
an upper bound on the maximum throughput subject to these 
two constraints and derive the key results of the paper While 
the heuristics are far from precise derivations of the optimal 
delay-throughput trade-offs, they may be useful to the reader 
in understanding the main results. In addition, the heuristic 
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arguments provide the right order for the "hitting distance" 
(to be defined later) which plays a critical role in the optimal 
scheme used to achieve the delay-throughput trade-offs. 

Consider the two-dimensional i.i.d. mobility model with fast 
mobiles. We say that a packet hits its destination at time slot 
f if the distance between the packet and its destination is less 
than or equal to L. Under the two-dimensional i.i.d. mobility 
model, a packet hits its destination with probability nL^ at 
each time slot. So given a delay constraint D, the probability 
that a packet hits its destination in one of D time slots is 

Furthermore under the fast mobility, only one-hop transmis- 
sions are feasible at each time slot. So the transmission radius 
needs to be at least L to deliver packets to the destinations 
when their distance is L. Assume all nodes use a common 
transmission radius L and that all nodes wish to transmit at 
each time slot, then each node has l/{cinL?) fraction of time 
to transmit, and the throughput per S-D pair is no more than 
l/(cinL^) where ci is a positive constant independent of n. 
Thus the network can be regarded as a system where there 
are two virtual channels between each S-D pair as in Figure 
[T] The packets are first sent over the erasure channel with 
erasure probability 

and then over the reliable channel with rate 

1 



ciL?-n 



bits per time slot. Each source can transmit at most W bits per 
time slot on average. So in this virtual system, the maximum 
throughput of a S-D pair is 



maxmin •< W I 1 



1 



ciL'-n 



nWD 



and the corresponding optimal hitting distance L* = bi/\/nl5 
where b\ = \/c\nW . 







Pe = {\-KL-f 
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Fig. 1. Virtual-channel Representation for the Tow-Dimensional I.I.D. 
Mobility Model with Fast Mobiles 

To achieve this throughput, we first need to use the optimal 
L. Furthermore, a coding scheme achieving the capacity of 
the erasure channel is needed. Since the erasure probability 
is determined by L and D, which are different under different 
delay constraints, rate-less codes become a reasonable choice. 
The key idea in this paper is to encode data packets using 
Raptor codes, which are near optimal rate-less codes with low 
complexity. We also note that the idea of using coding to 
improve reliability of packet delivery has also been considered 



by Shah and Shakkottai in [17] for ad hoc sensor networks in 
a different context. Our first result is as follows. 
Main Result 1: Under the two-dimensional i.i.d. mobility 
model with fast mobiles, the throughput per S-D pair is 
X = O (^-^D/n^ given a delay constraint D. For D is both 
a)(^/n) and o(«), this throughput can be achieved using a joint 
coding-scheduling algorithm. 

Note that the heuristic arguments leading up to the above 
result have many flaws. For example, it suggests that one can 
wait for the source to hit the destination to deliver the packet. 
In reality, such a scheme will not work since we dehver only 
one packet to the destination during each encounter between 
the S-D pair Thus other packets at the source which are not 
delivered may violate their delay constraints. This problem 
in the heuristic argument is due to the fact that it assumes 
that we have an independent erasure channel for each packet 
despite the fact that the transmitting node is the same source. 
Despite the flaws, the heuristic argument surprisingly captures 
the delay-throughput trade-off and the optimal hitting distance 
correctly up to the right order. In practice, the bound is 
achievable by exploiting the broadcast nature of the wireless 
channel to transmit each packet to several relay nodes and 
allowing relay nodes to independently attempt to deliver the 
packet the destination. 

Next consider the two-dimensional i.i.d. mobility model 
with slow mobiles. Since multi-hop transmissions are feasible 
at each time slot, using a precise version of the result [10] 
which was obtained in [4], the maximum throughput per S-D 
pair under the slow mobility assumption is 

1 



C2L^/n' 

where c? is a positive constant independent of n. We provide 
a crude version of the argument from [10] here for ease of 
readability. Suppose each node uses a transmission radius r 
and the distance between a S-D pair is L, then each bit has to 
travel L/r hops. The number of bit-hops needed to satisfy a 
throughput requirement of X bits/slot/node in T slots is XLT/r 
Due to the interference model, the number of simultaneous 
transmissions possible in one time slot is l/(c2r^) for some 
constant C2- Thus we need 

nXLT T 

<— , 

r C2r- 



or 



X < 



C2Lrn 

Intuitively, since the total area is 1 and the number of nodes is 
«, the smallest radius of transmission that can be used while 
ensuring connectivity is given by nnP' = 1 , so 



1 



C2L^/Kn 

That this is indeed achievable in an order sense is proved in 
[4], and therefore, we take X to be l/(c2Lv^) where C2 = 
\/nc2. Then the virtual channels between a S-D pair are as 
depicted in Figure |2] In this virtual system, the maximum 
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throughput of a S-D pair is 



maxmin < W I 1 



inWD 



'niD^ where b-j 



and the optimal hitting distance L* = i 
s/c27tW. This throughput can also be achieved using a joint 
coding-scheduUng scheme. The main result is summarized as 
follows. 

Main Result 2: Under the two-dimensional i.i.d. mobility 
model with slow mobiles, the throughput per S-D pair is 
A = O (^y/ojiij given a delay constraint D. This throughput 
can be achieved using a joint coding-scheduling scheme when 
D is both co(l) and o(«). 



Source 



P. = (1-kL- 
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Reliable Channel 



Destination 



Fig. 2. Virtual-channel Representation for the Two-Dimensional I.I.D. 
Mobility Model with Slow Mobiles 

As stated before, the crude virtual channel representation 
used in this section surprisingly yields the correct results. 
However, they do not form the basis of the proofs in the rest 
of the paper. Several assumptions have been made in deriving 
the virtual channel representation: 

(i) The hitting events for various packets are assumed to 
independent which is difficult to ensure since the same 
node may act as a relay for multiple packets. 

(ii) It assumes a fixed hitting distance which is not reason- 
able to obtain an upper bound on the throughput. An 
upper bound must be scheme-independent. 

In view of these limitations, we use the virtual channel model 
to only provide some insight into the results and the hitting 
distance we should use in the achievable algorithms, but 
rigorous proofs of the main results are provided in subsequent 
sections. 

v. two-dlmensional i.i.d. mobility model, fast 
Mobiles 

In this section, we investigate the two-dimensional i.i.d. mo- 
bility model with fast mobiles. Assuming that all mobiles have 
wireless communication and coding capability, we investigate 
the maximum throughput the network can achieve by using 
relaying and coding to recover packet loss as discussed in the 
heuristic arguments. Given a delay constraint D, we will first 
prove that the maximum throughput per S-D pair which can be 
supported by the network is O (^^/D/nj . Then a joint coding- 
scheduling scheme will be proposed to achieve the maximum 
throughput when D is both co{^) and o(«). 

A. Upper Bound 

In this subsection, we show the maximum throughput the 
network can support without network coding, i.e., under the 
following assumption. 



Assumption 1: Packets destined for different nodes cannot 
be encoded together Further, we assume that coding is only 
used to recover from erasures and not for data compression. 
Specifically we assume that at least k coded packets are 
necessary to recover k data packets, where all packets (coded 
or uncoded) are assumed to be of the same size. 

Assumption 1 is the only significant restriction imposed 
on coding/routing/scheduling schemes. We also make the 
following assumption. 

Assumption 2: A new coded packet is generated right before 
the packet is sent out. The node generating the coded packet 
does not store the packet in its buffer. 

Assumption 2 is not restrictive since the information con- 
tained in the new packet is already available at the node. 
Assumption 3: Once a node receives a packet (coded or 
uncoded), the packet is not discarded by the node till its 
deadline expires. 

Assumption 3 is not restrictive since we are studying an 
upper bound on the throughput in this section. 

Next we introduce following notations which will be used 
in our proof. 

• b : Index of a bit stored in the network. Bit b could be 
either a bit of a data packet or a bit of a coded packet. If a 
node generates a copy of a packet to be stored in another 
node, then the bits in the copy are given different indices 
than the bits in the original packet. 

• dj, : The destination of bit b. 

• ct, : The node storing bit b. 

• tj, : The time slot at which bit b is generated. 

• 5/, : If bit b is delivered to its destination, then S/, is the 
transmission radius used to deliver b. 

• ^[T] : The set of all bits stored at relay nodes at time 
slot T. We do not include bits that are still in their source 
node in defining ^[T]. 

. A[T]: A[T]=Z'UMT]- 

Assume that the delay constraint is D, and a data packet is 
processed by the source node at time slot tp. Then the data 
packet is said to be active from time slot fp to + D — 1 . A 
bit b is said to be active if at least one data packet encoded 
into the packet containing bit b has not expired. It is easy to 
see that any bit expires at most D time slots after the bit is 
generated. Also a bit is said to be good if it is active when 
delivered to its destination. Now let A[r] denote the number 
of good bits delivered to destinations in [0,r]. Without loss of 
generality, we assume good bits are indexed from 1 to A[T]. 
Note that expired bits might help decode good source bits but 
would not contribute to the total throughput, so we have 



A[T]>A[T], 



(2) 



where A[T] is the number of good source bits successfully 
recovered at destinations. 

Next we present three fundamental constraints. In the fol- 
lowing lemma, inequalities (O and (HI hold since the total 
number of bits transmitted or received in T time slots cannot 
exceed nWT. Inequality ^ holds since under the protocol 
model, discs of radius Ar,/2 around the receivers should be 
mutually disjoint from each other 



5 



Lemma 1: For any mobility model, the following inequali- 
ties hold, 



A[T] < nWT 
WAT]\ < nWT 
WT 



6=1 



16 - . 



< 



(3) 
(4) 

(5) 



where \^[T] \ is the cardinality of the set £^[T]. 

Proof: Since each node can transmit at most W bits 
per time slot, the total number of bits transmitted in T time 
slots is less than nWT which implies inequalities (O and (01). 
Inequality (|5]i was proved in [2]. ■ 

We first consider the scenario where packet relaying is not 
allowed, i.e., packets need to be directly transmitted from 
sources to destinations. In the following lemma, we show that 
the throughput in this case is at most 0(1 /\/«) even without 
the delay constraint. 

Lemma 2: Consider the two-dimensional i.i.d. mobility 
model with fast mobiles. Suppose that packets have to be 
directly transmitted from sources to destinations, then 



8v/2 



WT^/7l>E[^.[T]]. 



(6) 



Proof: First from the Cauchy-Schwarz inequality and 
inequality (|5]), we have 




< 



\b=\ / \b=\ 



which implies 
E 



m ' 

b=l 




A[T] 



(7) 



This gives an upper-bound on the expected distance travelled. 
Next we bound the total number of times that each mobile gets 
within a distance L of its destination for L e [0, 1/2] . From the 
i.i.d. mobility assumption, we have that for any /, j and f, 



Pridisl{iJ){t)<L)^7:L\ 



which implies 

T I 



E E ldist(,,((( 



t=\ \i=l 



+ 1) modn))(r)<L 



KL^nT. 



Since at most W bits can be transmitted at each time slot, we 
further have 

A[r] T n 

E ^h<L - ^E E ldist((,((/+l) moin)){t)<L- 
b=\ t=\i=\ 

Taking expectation on both sides of above inequality, we 
obtain 



E [A[T]] -E 



"m 

E ^St>L 

b=l 



< WnL'-nT. 



(8) 



Now using Jensen's inequality and inequalities O and ([8]), 
we can conclude that 



E [K[T\ 



> 





/lewT 




' 7rA2 




m ] 


E 


b=\ 






\m 


LE 


Eu, 

b=l 



m 



> L{E [A[T]] - WnLW) . (9) 
Note that inequality (|9]l holds for any LE [0, 1/2]. We choose 



' E[A{T]] 
iKnWT ' 



which is less than 1/2 since A\T] < nWT. Substituting L* into 
inequality we have 



[A[T]] 

which implies that 

8V2 



> -L*E[A[T]] 



WT^ > E [A[T]] . 

The lemma then follows from inequality (|2]i. ■ 
Next we investigate the maximum throughput the network 
can support using coding/routing/scheduling schemes. We 
have obtained an upper bound on the number of bits directly 
transmitted from sources to destinations in Lemma|2l To bound 
the maximum throughput with relaying, we will calculate the 
number of bits transmitted from relays to destinations in the 
following analysis. 

Theorem 3: Consider the two-dimensional i.i.d. mobility 
model with fast mobiles, and assume that Assumption 1-3 
hold. Then given a delay constraint D, we have that 

5^V^fVz5+0>£[A[r]]. (10) 



Proof: In the proof of the theorem, we treat active bits 
at relays and active bits at sources differently since we can 
bound the number of active bits at relays using inequality (|4|l, 
while the number of active bits at sources could be larger Let 
A''\T] denote the number of good bits delivered directly from 
relays to destinations in[0,r]. Without loss of generality, we 
assume these good bits are indexed from 1 to A''[r]. Similar 
to inequality (|7|i, we first have 



A'[r] • 

b=\ 



< 



16WT \ 
1^) 



A'[T] 



(11) 



Let Lt, denote the minimum distance between node dt, and 
node ct, from time slot th to time slot tj,+D—l, i.e., 

Lf, = min dist(ii;,,c;,)(f). 

tb<t<tb+D-l 

Then for any L E [0, 1/2] and any bit b E ^^[T], we have 
Pr (Lf, < L) = 1 - (1 - TzL^f < kL^D, 
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which implies 



< nWTnU-D. 



Furthermore, we have 

V[r] 



h= 

which implies that 
E 



h=l be,i?[T] 



A'[T] 
h=\ 



< nWTnU-D. 



(12) 



Thus we can conclude that 



h=\ 



> E 



> LE 



A'-IT] 

E SbiSi,>L 
b=l 

E ^Sb>L 

h=l 



> L{E[A''[T]]-E 



IA'[T] 

E ^Si,<L 
_b=l 

> LE[A''[T]]-nWT7zL^D, 



where the last inequality follows from inequality (fT2l) . 

Now using Jensen's inequality and inequaUties ( fTTT ) and 
( flJl l, we have that for any L G [0, 1/2], 



[A'-[r]] > LE[A''[T]]~nWTnL^D. 



Substituting 



E [A''[T] 



V InWTnD 
into inequality ( fT4b . we can conclude that 

8V2WT r-pj^^^rATT^il 
T VnD>E[A [T]\. 



The theorem follows from inequalities (|6]l, (flST l and Q. ■ 
From Theorem [3j we can conclude that the throughput per 
S-D is 0{y^D/n) given a delay constraint D. 

B. Joint Coding-Scheduling Algorithm 

In Section HV] we motivated the need to first encode data 
packets. In this subsection, we use Raptor codes and propose 
a joint coding-scheduling scheme to achieve the maximum 
throughput obtained in Theorem [3] 

Motivated by the heuristic argument in Section IIVI we 
divide the unit square into square cells with each side of length 
equal to 1 / ^fnD, which is of the same order as the optimal 
hitting distance. In our scheme, we will allow final delivery 
of a packet to its destination only when a relay carrying the 
packet is in the same cell as the destination. Thus, a packet 
is delivered only when the relay and destination are within a 
distance of ^/2/ \/nD, which is also the same as the hitting 
distance calculated in Section |IV] except for a constant factor 
which does not play a role in the order calculations. The 



mean number of nodes in each cell will be denoted by M 
and is equal to yfnJT). The transmission radius of each node 
is chosen to be V2/ \fnD so that any two nodes within a cell 
can communicate with each other This means that, given the 
interference constraint, two nodes in a cell can communicate 
if all nodes in cells within a fixed distance from the given cell 
stay silent. Each time slot is further divided into C mini-slots 
and each cell is guaranteed to be active in at least one mini- 
slot within each time slot. Assume C = 9. The reason we use 
nine mini-slots is that if a node in a cell is active, then no 
other nodes in any of its neighboring eight cells can be active, 
but nodes outside this neighborhood can be active. Further, we 
denote the packet size to be W/ (2C) so that two packets can 
be transmitted in each mini-slot. 

A cell is said to be a good cell at time t if the number of 
nodes in the cell is between 9M/10+ 1 and UM/IO. We also 
define and categorize packets into four different types. 

• Data packets: There are the uncoded data packets that 
have to be transmitted by the sources and received by the 
destinations. 

• Coded packets: Packets generated by Raptor codes. We 
let (i,^) denote the A:* coded packet of node /. 

• Duplicate packets: Each coded packet could be broadcast 
to other nodes to generate multiple copies, called dupli- 
cate packets. We let {i,k,j) denote a copy of {i,k) carried 
by node j, and {i,k,J) to denote the set of all copies of 
coded packet {i,k). 

■ DeUverable packets: Duplicate packets that happen to be 

within distance L from their destinations. 
We now describe our coding/scheduling algorithm. 
Joint Coding-Scheduling Scheme I: We group every 6D time 
slots into a super time slot. At each super time slot, the nodes 
transmit packets as follows. 
(1) Raptor Encoding: Each source takes 6D/{25M) data 
packets, and uses Raptor codes to generate D/M coded 
packets. 

(15) (2) Broadcasting: This step consists of D time slots. At 
each time slot, the nodes executes the following tasks: 



(13) 



(14) 



(3) 



(i) In each good cell, one node is randomly selected. 
If the selected node has not already transmitted 
all of its D/M coded packets, then it broadcasts a 
coded packet that was not previously transmitted 
to 9M/10 other nodes in the cell during the mini- 
slot allocated to that cell. Recall that our choice of 
packet size allows one node in every good cell to 
transmit during every time slot. 

(ii) All nodes check the duplicate packets they have. 
If more than one duplicate packets have the same 
destination, select one at random to keep and drop 
the others. 

Receiving: This step consists of 5D time slots. At 
each time slot, if a cell contains no more than two 
deliverable packets, the deliverable packets are delivered 
to their destinations using one-hop transmissions during 
the mini-slot allocated to that cell. At the end of this 
step, all undelivered packets are dropped. The destina- 
tions decode the received coded packets using Raptor 
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decoding. 

Note that in describing the algorithm, we did not account for 
the delays in Raptor encoding and decoding. However, Raptor 
codes have linear encoding and decoding complexity. Hence, 
even if these delays are taken into account, our order results 
will not change. 

Theorem 4: Consider Joint Coding-Scheduling Algorithm 
I. Suppose D is both (£){^/n) and o(n), and the delay constraint 
is 6D. Then given any e > 0, there exists no such that for any 
n > tiQ, every data packet sent out can be recovered at the 
destination with probability at least 1 — £, and furthermore 



(16) 



Proof: Let denote the f* super time slot. For each 
super time slot, the proof will show that the following events 
happen with high probability. 

Broadcasting: At least 16D/(25M) coded packets from a 
source are successfully duplicated after the broadcasting step 
with high probability, where a coded packet is said to be 
successfully duplicated if the packet is in at least 4M/5 distinct 
relay nodes. Letting Ai[ts] denote the number of coded packets 
which are successfully duplicated in super time slot f^ , we will 
first show that there exists «i such that for any n > «i, 

>l;[f.v] > I > 1 -3e"5mm. (17) 




Pr 



25 M 



Receiving: At least 1D/{25M) distinct coded packets from 
a source are delivered to its destination after the receiving 
step with high probability. Letting B, [fs] denote the number of 
distinct coded packets delivered to destination / + 1 in super 
time slot f^, we will show there exists 112 such that for all 
n > n2, 



Pr 



ID 

^ ^ - 25M 



Ai\t,]>—- I >l-2e- 
^ ^ - 25M 



D 

WM 



(18) 



Decoding: The 6D/25M data packets from a source are 
recovered with high probability. Letting S'i[ts] denote the event 
such that all 6D / (25M) data packets are fully recovered, we 
will show that 



for some a > 0. 
Recall that M = 



Bi[ts] > 



7 D 
25M 



> 1 



(19) 



M y 

lim — = lim — 



= 0. 



D 



Combining inequaUties (fT7]i-(fT9]l, we can conclude that for any 
e < 1/19, there exists «o > max{«i,n2} such that for n > «o, 

Pr(^[f.J)> 1-e, (20) 

which implies that every data packet sent out can be recovered 
with probability at least 1 — e. Since 1— e> 18/19, from the 
Chernoff bound (see Lemma [TT] provided in the Appendix C 
for convenience), we can conclude that for « > no, 

p4tui>s]>^T,]>l-e-^, 



where we choose 5 = 1/20 in Lemma [T2] Note that 
l^,[f,] > TO^v implies at least 



9 6D 

— r,. X — 

10 25M 



W 



27W DT, 27W 



2C 250C M 



250C 



bits are successfully transmitted from node / to node i+l in 
Ts super time slots. Since each super time slot consists of 6D 
time slots, we can conclude that for n > no. 



/ 27W D \ 

P^(^'[^^^^]^ 250C^W^"'j^^ 

which implies that, for a fixed n > no, 

Um Pr > W- 

T^oo \ T - 500C V n 




_ 7:1 
ne ™, 



1. 



To complete the proof, we now show inequalities dTTl ). dTsl ) 
and ([Ull. 

Analysis of broadcasting: Let ^, [f] denote the event that 
node ; broadcasts a coded packet at time slot t. So ^, [f] occurs 
when following two conditions hold: 

(i) The cell node / in is a good cell; 

(ii) Node / is selected to broadcast. 

Since the nodes are uniformly randomly positioned, from the 
Chernoff bound we have 

Pr(^,-[f])> 



IIM 



M 

1 -2e"™ 



which implies that there exists ni such that for any n > ni, 

1 



Pr(^,[?])> 



9M 



Then from the Chernoff bound again, we have 



4D\ 



(21) 



for n > ni . Thus, with a high probability, more than 4D/ (5M) 
coded packets are broadcast, and each broadcast generates 
9M/10 copies. 

Duplicate packets might be dropped at step (ii) of the 
broadcasting step. We next calculate the number of duplicate 
packets of node / left after the broadcasting step. Assume node 
/ broadcasts 5, coded packets, so 5, <D/M. Then the number 
of duplicate packets left after the broadcasting step is the same 
as the number of nonempty bins of following balls-and-bins 
problem, where the bins represent the mobile nodes other than 
node ;, and the balls represent the duplicate packets broadcast 
from node /. 

Balls-and-Bitis Problem: Assume we have (n— 1) bins. At 
each time slot, we select 9M/10 bins and drop one ball in 
each of them. Repeat this 5,- times. 

Using A^i to denote this number, from Lemma [12] in Ap- 
pendix C, we have 

Pr(M > (l-5)(n-l)pi) > l-2e-^'("-l)Pl/^ 



where 



= 1-. 



9D;M 
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Using the fact 1 — e * > x — for any x > 0, we get 



{n-l)pi 



<)DiM 
lOn-lO 



> 



> 



{n-l)[l- 
9D,M 81D?m2 



10 
44 - 

49^'^' 



100«- 100 



where the last inequality holds for « > n2 for some «2 since 
D,M < Z) = o(«). Thus choose 5 = 1/50 and we can conclude 
for « > n2, 



/ 22 
Pr hVi > 25^i^ 



El.^,M=5,- >1 



-2e"-nm5. (22) 



r=l 



Recall a coded packet is said to be successfully duplicated if 
it has at least 4M/5 copies at the end of the broadcasting step. 
Inequality (|22] | implies for n > n2, 




A > 1 



since otherwise, less than 22Z),M/25 duplicate packets are left 
in the network. Thus we can conclude that for n>h2, 



Pr 



25 M 



> 1 — 2e ^OTD . 



(23) 



Letting «i = max{ni,n2}, inequality (fTTI i follows from in- 
equalities (l2Tl i and (|23] | for n>n\. 

Analysis of receiving: Assume coded packets 
{(/, 1), . . . , (i, 16D/(25M))} are successfully duplicated. 
We let i^(,jt) [t] denote the event that coded packet {i, k) is 
delivered at time slot t. Then ^(,jt)[f] will definitely occur if 
both the following conditions hold: 

(i) One and only one duplicate packet of {i,k) becomes 
a deliverable packet. Let S!j^.j^-^[t] denote this event. 
Assume the duplicate packet is {i,k,j), i.e., node j 
contains packet {i,k). 

(ii) There are no other deliverable packets in the cell con- 
taining node j except packet {i,k,j) and one possible 
duplicate packet to node j carried by node / + 1 . Let 



f denote this event. 



Note that duplicate packets of node ; are carried by different 
nodes, and their mobilities are independent. Now assume there 
are M(,jt) copies of {i,k) in the entire network, then 



Note that (1 - M/n) 



M 
n 



(i.k)- 



> 1 



■(%-i)-l)^/n, andM(,;^) 



> 



4M/ if [i^k) is successfully duplicated. So for a successfully 
duplicated packet, there exists «3 such that for any « > na, 



Pr 



> 



7M2 

To^' 



Suppose we have M nodes in the cell containing node j, 
from the Chernoff bound, we have 

11 \ M 

Pr ( M< —M I > 1 -e"3ro. 



Note that condition (ii) is equivalent to the following event: 
Given node j and node ; + 1 in the cell, no more deliverable 
packets appear when we put another M— 2 nodes into the cell. 
Now given K nodes in the cell, the probability that no more 
deliverable appears when we put another node is at least 



2KD 

n-K 



This holds due to the following two facts: 

(a) The new node should not be the destination of any 
duplicate packets already in the cell (there are at most 
KD duplicate packets already in the cell). 

(b) The duplicate packets carried by the new node are not 
destined for any of the existing K nodes. Note that each 
source has no more than D duplicate packets, so there 
are at most KD nodes which carry the duplicate packet 
towards the K existing nodes. 

Note that lim„^ooM = ex., so there exists «4 such that for any 
n > n4, 



Pr 



> 



> 



> 



1 



M 



M 



TT-rrr 



2KD 



n-K 



22MD 
lOn- IIM 



UM 



3 

IT' 



So we can conclude that for any n > max{n3,«4}, 

21M2 21 



Pr(%i)[r])> 



HOD' 

a successfully duplicated 



110« 

which implies at each time slot, 
packet {i,k) will be delivered with probability at least 
21/(1 lOD). Note at each time slot, only one coded packet 
can be delivered to the destination of node ;. So the number 
of distinct coded packets delivered to the destination of node 
/ is the same as the number of nonempty bins of following 
balls-and-bins problem, where the bins represent the distinct 
coded packets, the balls represent successful deliveries, and 
a ball is dropped in a specific bin means the corresponding 
coded packet is delivered to the destination. 

Balls-and-bins Problem: Suppose we have 16D/(25M) bins 
and one trash can. At each time slot, we drop a ball. Each bin 
receives the ball with probability 21/(1 lOD), and the trash 
can receives the ball with probability 1 — p, where 

21 16D _ 168 1 
HOD ^ 25M ~ 1375 M' 
Repeat this 5D times, i.e., 5D balls are dropped. 

Let ^^^2 denote nonempty bins of the above balls-and-bins 
problem and choose 5 = 1/6. From Lemma [T2l in Appendix 
C, we have 



Pr 



7 D 

N2> 

^ - 25M 



> 1 - 2e 



D 



and inequality (fTsl l holds for « > «2, where «2 = max{n3,n4}. 

Analysis of decoding: Inequality ( fT9l i follows from Lemma 
|9] on the error probability of Raptor codes provided in Ap- 
pendix A. ■ 
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vi. two-dlmensional i.i.d. mobility model, slow 
Mobiles 

In this section, we investigate the two-dimensional i.i.d. 
mobility model with slow mobiles. Given a delay constraint 
D, we first prove the maximum throughput per S-D pair 
which can be supported by the network is O (^-^/D/nj . Then 
a joint coding-scheduling scheme is proposed to achieve the 
maximum throughput. 

A. Upper Bound 

Let 4 denote the time slot in which bit b is delivered to 
its destination. Under slow mobiUty, the delivery in 4 could 
use multi-hop transmissions, so we further define following 
notations: 

• Hi, : The number hops bit b travels in time slot 4. 

• Lh : The Euclidean distance bit b travels in time slot 4. 

• 5^ : The transmission radius used in hop h for 1 <h<Hi,. 

Similar to Lemma [T] we have following results. 

Lemma 5: For any mobility model, the following inequali- 
ties hold. 



A[r] H,, 



k[T\ H,, 

^ ^ 1 < nWT 

h=\ h=\ 



b=\ h=l 



(24) 

(25) 

□ 

Similar to the fast mobility cases, we first consider the 
throughput under the assumption that the packets can only 
be delivered to destinations from sources. 

Lemma 6: Consider the two-dimensional i.i.d. mobility 
model with slow mobiles. Suppose that packets have to be 
directly transmitted to destinations from sources, then 



3/2 



7i>E[A[T]] 



(26) 



Proof: First from the Cauchy-Schwartz inequality and 
Lemma |5] we have that 



V 6=1/1=1 / 



/A[T] H, \ /k[T] Hu 2^ 

\b=\h=l J \b=\h=i 

m H,, 7 

h=ih=i 



< 



which implies 



since La'Ii ■S''; > L^. The rest of the proof is same as the 
proof of Lemma |2] ■ 



Theorem 7: Consider the two-dimensional i.i.d. mobility 
model with slow mobiles, and assume that Assumption 1-3 
holds. Then given a delay constraint D, we have 

4^WT ,,,, 



3/A 



1 >£[A[r]]. 



(27) 



Proof: Similar to the proof of Theorem [3] ■ 
From Theorem |7] we can conclude that the throughput per 
S-D is 0{\/D/n) given a delay constraint D. 

B. Joint Coding- Scheduling Algorithm 

In this subsection, we propose a joint coding-scheduling 
scheme to achieve the throughput suggested in Theorem |7] In 
the receiving step, we divide the unit square into square cells 
with each side of length equal to l/v^nD^, which is of the 
same order as the optimal hitting distance obtained in Section 
HV] The mean number of nodes in each cell will be denoted 
by M2 and is equal to ^l^njD. The packet size is chosen to 
be 

low 



llc,CVM2 

so that at each time slot, all nodes in a good cell can transmit 
one packet to some other node in the same cell by using the 
highway algorithm proposed in [4] (see in Appendix B), where 
Cy is a constant independent of n. In the broadcasting step, the 
unit square is divided into square cells with each side of length 
equal to 1 jyj rP-D. The mean number of nodes in each will be 
denoted by M\ and is equal to IJnjD. In the broadcasting 
step, the transmission radius of each nodes is chosen to be 
V2v^ n^D. Note the packet size is 

low _ low 

So in the broadcasting step, all nodes in a good cell could be 
scheduled to broadcast one coded packet at one min-slot. Also 
note that MiM2D/n = 1. 

Joint Coding-Scheduling Algorithm II: We group every 16D 
time slots into a super time slot. At each super time slot, the 
nodes transmit packets as follows: 

(1) Raptor Encoding: Each source takes 2D/5 data pack- 
ets, and uses Raptor codes to generate D coded packets. 

(2) Broadcasting: The unit square is divided into a regular 
lattice with n/M\ cells. This step consists of D time 
slots. At each time slot, the nodes execute the following 
tasks: 

(i) In each good cell, the nodes take their turns to 
broadcast a coded packet to 9Mi/10 other nodes 
in the cell. We use the same definition of a good 
cell as in Algorithm I, i.e., the number of nodes in 
a good cell should be with a factor of the mean, 
where the factor is required to lie in the interval 
[0.9,1.1]. 

(ii) All nodes check the duplicate packets they have. 
If more than one duplicate packet is destined to a 
same destination, randomly keep one and drop the 
others. 
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(3) Receiving: The unit square is divided into a regular 
lattice with n/M2 cells. This step consists of 15D time 
slots. At each time slot, the nodes in a good cell execute 
the following tasks in the mini-slot allocated to that cell. 

(i) Each node containing deliverable packets randomly 
selects a deliverable packet, and sends a request to 
the corresponding destination. 

(ii) Each destination only accepts one request and 
refuses the others. 

(iii) The nodes whose requests are accepted transmit 
the deliverable packets to their destinations using 
the highway algorithm proposed in [4]. 

At the end of this step, all undelivered packets are 
dropped. Destinations use Raptor decoding to obtain the 
source packets. Note that one requires some overhead 
in obtaining route to the destination to perform step 
(3)(i) above. As in previous works, we assume that this 
overhead is small since one can transmit many packets 
in each time slot, under the slow mobility assumption. 
Theorem 8: Consider Joint Coding-Scheduling Algorithm 
II. Suppose D is both fi)(l) and o(«), and the delay constraint 
is 16D. Then given any e, there exists «o such that for any 
n > no, every data packet sent out can be recovered at the 
destination with probability at least 1 — £, and furthermore 



(28) 



Proof: The proof will show that the following events 
happen with high probability. 

Broadcasting: At least 4D/5 coded packets from a source are 
successfully duplicated after the broadcasting step with high 
probabihty, i.e.. 




Pr ( A,[fv] >^d] > 1 - Be^™? , 



(29) 



where a coded packet is said to be successfully duplicated if 
the packet is in 4Mi/5 distinct relay nodes. 
Receiving: At least D/2 distinct coded packets from a source 
are delivered to its destination after the receiving step with 
high probability, i.e.. 



Pr B,W> 



D 



Ai[t,] > -D 



> 1 



p 



(30) 



After obtaining inequality ( |29l ) and ( l30l l. the theorem can 
be proved by following the argument in Theorem HI 

Analysis of broadcasting: Similar to the analysis of in- 
equality (1% . 

Analysis of receiving: Assume that coded packets 
(/, 1), . . . , (i,4D/5) are successfully duplicated. Note that du- 
plicate packets from a common source are carried by different 
nodes; and the mobilities of those nodes are independent. So 
{i,k) will be definitely delivered to its destination if both the 
following conditions hold: 
(i) A copy of {i,k) is the only deliverable packet for 
destination / +! in time slot t. Let f^^'. ^^[r] denote this 
event; and assume that the dupUcate packet is in node 
j- 



(ii) Node j has no other deUverable packet, and the cell 
containing node j is good. Let S''^- ^.j [f ] denote this event. 

Let M(, J.) denote the number of copies of {i,k). Since each 
source has at most 9MiD/10 dupUcate packets in the network, 
we have 



> 



It is easy to verify that 



lim I 1 

n 



.-0.9 



Thus, for successfully duplicated packet {i,k), i.e., M{i^k) ^ 
4Mi/5, we can conclude that 



> 



4M1M2 

5 en 



(31) 



holds for sufficiently large n. Note that each node carries at 
most DMi duplicate packets, so we further have 



Pr 



DMi-l 



(32) 



where (1 -Mi/n) 



DMi-\ 



is the lower bound on the probability 



that all packets in node j except {i,k,j) are undeliverable, and 

1 — 2e^3ro is the probability that the cell is good. 

From inequalities OTT ) and ( |32] ). we can conclude that for 
sufficiently large «, 

= Pr( 

^ M1M2 

- 12n 
1 

IZD" 

Inequality ( [3Q] l can be proved by the balls-and-bins argument 
used to show inequality ( fTSl l. ■ 

VII. Conclusion 

In this paper, we investigated the optimal delay-throughput 
trade-off in ad-hoc networks with two-dimensional i.i.d. mo- 
bility models. For the two-dimensional i.i.d. mobility model 
with fast mobiles, the optimal trade-off was shown to be 
X ~ & (^-^D/n^ when D is both ©(-y/n) and o(«). For the 
two-dimensional i.i.d. mobility model with slow mobiles, the 
optimal trade-off was shown to be A = ^ y^D/tij when D 
is both fi)(l) and o(«). 

We now briefly comment on the conditions that we have 
imposed on the delay requirement to obtain the optimal delay- 
throughput tradeoffs. In the slow mobility case, we have 
assumed that the required delay has to be (b(1). This condition 
on the delay is used to allow the decoding error probability to 
go to zero as « °o. If we allow a small probability of loss 
(it can be arbitrarily small), then one can allow the delay to 
be 0(1). We have also assumed that the delay is o(«). This is 
not really a restriction since it is easy to see from prior work 
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that the best achievable throughput of 0(1) is obtained when 
the delay is co{n) [9]. The o(n) condition on delay is used in 
our paper only to ensure that our cell partitioning, scheduling 
and coding strategy works. 

In the case of fast mobiles, when D is (9(-y/«), then the 
number of packets that can be transmitted in D time slots is 
a constant and hence one cannot use coding to ensure that 
the probability of packet loss is arbitrarily small. In this case, 
one can obtain a bound that is a logarithmic factor smaller 
than the upper bound using packet replication techniques as 
has been done in [11] for the slow-mobile case. However, 
the best achievable lower bound is unknown. Again the o(n) 
requirement is not significant since a throughput of 0(1) can 
be achieved if the delay requirement is larger [9]. 
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Appendix A: Raptor Codes 
A. Raptor Codes 

Raptor codes are low-complexity, near-optimal rate-less 
codes for erasure channels. It was proposed by Shokrollahi 
in [19], and the following result was presented in [19]. 

Lemma 9: The receiver can correctly decode the M data 
packets with probability at least 1 — 1/(M"''^') for some 
fl(e) > after it obtains (1 + e)M coded packets generated 
by Raptor codes. The number of operations used for encoding 
and decoding is 0{M). 

□ 

Appendix B: Throughput of Static Wireless 
Networks 

The throughput of a random wireless network with n static 
nodes and n random S-D pairs is introduced by Gupta and Ku- 
mar [10]. They showed that the maximum throughput per S-D 
pair is 0(1/ y/n), and proposed a scheduling scheme achieving 
a throughput of 0(1/ y/nlogn) per S-D pair This log« gap was 
latter closed by Franceschetti et. al in [4] where they showed a 
throughput of 0(l/-y/«) per S-D pair is achievable. The result 
is obtained under the physical interference models. However, 
it can be easily extended to the protocol model by using the 
same algorithm. 

Lemma 10: In a random wireless network with n static 
nodes and n S-D pairs, a throughput of 

bits/time-slot per S-D pair is achievable, where 
c.v is a positive constant independent of «. 

□ 

Suppose the nodes use a common transmission radius r = 
0(1 /«). The key idea of [4] is to construct 0(«) disjoint paths 
traversing the network vertically and horizontally. These paths 
are called highways in [4], and a throughput of 0(1/-^) per 
S-D pair is achievable by transmitting data throughput these 
highways. We call this algorithm a highway algorithm in this 
paper. 

Appendix C: Probability Results 

In this appendix, we present some standard results in 
probability for the reader's convenience. In addition, we also 
present some variations of standard results which do not seem 
to be available in any book to best of our knowledge. 

The following lemma is a standard result in probabiUty, 
which we provide here for convenience. 
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Lemma 11: Let Xi,...,X„ be independent 0—1 random 
variables such that = jU. Then, the following Chernoff 
bounds hold 



Pr|^£x,<(l-5)Mj < 
Pr( 52X, > (1 + 5)m ) < e-^^i"!^ 



(33) 
(34) 



Proof: A detailed proof can be found in [14]. ■ 
The next lemmas are variations of standard balls-and-bins 
problems. However, we have not seen the results for the 
particular variation that we need in this paper. So we present 
the lemmas along with brief proofs below. 

Lemma 12: Assume we have m bins. At each time, choose 
h bins and drop one ball in each of them. Repeat this n times. 
Using A^i to denote the number of bins containing at least one 
ball, the following inequality holds for sufficiently large «. 



Pr(A?i < (1 - 5)mpi) < 2e-^~"'P^I\ 



(35) 



where p\ = l —e^i^. 

Proof: At each time, bin ; receives a ball with prob- 
ability li/m. We let k", denote the number of balls in bin /. 
Now consider a related balls-and-bins problem where the ball 
dropping procedure is replaced by a certain number of trials 
as dictated by a Poisson random variable. Specifically, define 
« to be a Poisson random variable with mean «, and repeat 
the ball dropping procedure « times. Let kj denote the number 
of balls in bin / in this case. It is easy to see that {S",} are 
i.i.d. Poisson random variables with mean nh/m. So we can 
conclude 

Pr(M<(l-5)m/5i) = Pr[f;i^,>i<(l-5)mpij 



< Pr £ls,>i<(l-5)m/5, 



i=l 



n > n 



< 



pr(iriii^,>i<(i- 

Pr(n > «) 
/ m ^ 
2Pr £l^,>i<(l-5)m/3i 



Since 



Pr(ls,>i = 1) =Pr()f, > 1) = 1 - e-^ =/7i, 
from Lemma [TT] we have 

Pr(M < (l-5)mpi) < 2e-^^'"P'/\ 



containing at least 1 ball, the following inequality holds for 
sufficiently large n. 



Pr(A?2 < (l-5)mp2) < 2e-^'""'2/3; 



(36) 



where p2 — I — e^~ . 

Proof: Let K", denote the number of balls in bin /. Next 
define « to be a poisson random variable with mean n. We 
consider the case such that n balls are independently dropped 
in m bins. Using K", to be number of balls in bin / in this case, 
it is easy to see that {jq} are i.i.d. poisson random variables 
with mean — . 

in 

Now given n/,, we first have 



Pr(A^2 < il-8)mp2 



\i=l 



< 



Pr(^£ljc,>i<(l-5)mp, 
Pr(Lr=il'c,>i <(l-5)m/52 



n> n 



Pr(« > «) 



Since 

Pr(U,>i = l)=Pr(>f,>l) = l 
from Lemma nn we have 



P2, 



Pr(i:iic,>i<(l-5)m/52j <e-^-"^2/3. 



which implies for sufficiently large n, 

Pr(A^2 < (1 - d)mp2) < V37me-^''"P^/^ 



The above idea of using a Poisson number of trials to bound 
the probability of the occurrence of an event in a fixed number 
of trials is called the Poisson heuristic in [14]. 

Lemma 13: Suppose n balls are independently dropped into 
m bins and one trash can. After a ball is dropped, the 
probability in the trash can is I ~ p, and the probability in 
a specific bin is p/m. Using N2 to denote the number of bins 



